“Calhoun 


Institutional Archive of the Naval Postgraduate School 





Calhoun: The NPS Institutional Archive 
DSpace Repository 


Theses and Dissertations 1. Thesis and Dissertation Collection, all items 


1994 


Artificial neural network modeling of damaged aircraft 


Brunger, Clifford A. 


Monterey, California. Naval Postgraduate School 
http://hdl.handle.net/10945/42931 


This publication is a work of the U.S. Government as defined in Title 17, United 
States Code, Section 101. Copyright protection is not available for this work in the 
United States. 


Downloaded from NPS Archive: Calhoun 


Calhoun is the Naval Postgraduate School's public access digital repository for 


\§ D U DL EY research materials and institutional publications created by the NPS community. 
«iis Calhoun is named for Professor of Mathematics Guy K. Calhoun, NPS's first 


NY KNOX appointed -- and published -- scholarly author. 


LIBRARY Dudley Knox Library / Naval Postgraduate School 
411 Dyer Road / 1 University Circle 


http://www.nps.edu/library Monterey, California USA 93943 





© 
N 
N 


———— 
a 
= 
ferrari 
= 
—— 
— 
= 
— 
——— 
————— 
—— 
—= 
—— 
—= 
_—_ 
=== 
= 
— 
—= 
—_—— 
yi 


94-25 


oe 
ZS 





AD-A283 227 
ee 


NAVAL POSTGRADUATE SCHOOL 
Monterey, California 


€ . 


ARTIFICIAL NEURAL NETWORK MODELING 
OF DAMAGED AIRCRAFT 


by 
Clifford A. Brunger 


March, 1994 


Thesis Advisor: Daniel J. Collins 





Approved for public release; distribution is unlimited. 


94 8 19g 002 








REPU UR] LASOL 


enclassi ; N GE oe 
REPORT DOCUMENTATION PAGE_ 


Clifford A. sehen ger 


The vis views expressed i in this deals a are thoes sat the author and do not reflect the official policy or position of 
the Department of Defense or the U.S. Government. 


Aircraft design and control techniques rely on the proper modeling of the aircraft’s equations of motion. 
Many of the variables used in these equations are aerodynamic coefficients which are obtained from scale 
models in wind tunnel tests. In order to model damaged aircraft, every aerodynamic coefficient must be 
determined for every possible damage mechanism in every flight cor:iition. Designing a controller for a 
damaged aircraft is particularly burdensome because knowledge of the effect of each damage mechanism on the 
model is required before the controller can be designed. Also, a monitoring system must be employed to decide 
when and how much damage has occurred in order to re configure the controller. Recent advances in artificial 
intelligence have made parallel distributed processors (artificial neural networks) feasible. Modeled on the 
human brain, the artificial neural network’ lac pan cael pao ability to generalize from a given model. This 
thesis examines the robustness of the artificial neural network as a model for damaged aircraft. 





S/N 0102-LF-014-6603 Unclassified 





Approved for public release; distribution is unlimited. 


Artificial Neural Network Modeling of Damaged Aircraft 
by 
Clifford A. Brunger 
Lieutenant, United States Navy 
B.S.E.E., United States Naval Academy, 1984 
Submitted in partial fulfillment of the 
requirements for the degree of 


MASTER OF SCIENCE IN AERONAUTICAL ENGINEERING 


from the 


NAVAL POSTGRADUATE SCHOOL 


MARCH 1994 


Author: 


Approved by: 










Richard M. Howard, Second Reader 





Daniel J. Qollins, Chairman 
Department of Aeronautics and Astronautics 








ABSTRACT 


Aircraft design and contro! techniques rely on the proper 
modeling of the aircraft's equations of motion. Many of the 
variables used in these equations are aerodynamic coefficients 
which are obtained from scale models in wind tunnel tests. In order 
to model damaged aircraft, every aerodynamic coefficient must be 
determined for every possible damage mechanism in every flight - 
condition. Designing a controller for a damaged aircraft is 
particularly burdensome because knowledge of the effect of each 
damage mechanism on the model is required before the controller 
can be designed. Also, a monitoring system must be employed to 
decide when and how much damage has occurred in order to re 
configure the controller. Recent advances in artificial intelligence 
have made parallel distributed processors (artificial neural 
networks) feasible. Modeled on the human brain, the artificial 
neural network’s strength lies in its ability to generalize from a 
given model. This thesis examines the robustness of the artificial 


neural network as a model for damaged aircraft. 


Accession For ' 


NTIS GRA&L 
DTIC FAB 
Unsrnounced 
Jucti£lentio 


| 









QO 
O 








By. 






wollauility Codes 


jAvcth and/or 






iii 





TABLE OF CONTENTS 


A. 


ARTIFICIAL NEURAL NETWORKS 
ARTIFICIAL NEURAL NETWORK OPERATION 
1. 


3. Learning Algorithm 
BACK-PROPAGATION 
1. 


. EXPERIMENTAL PROCEDURE 
A. HARDWARE-SOFTWARE 


Modeling of A-4D Longitudinal Motion 
Generation of Learn and Test Files 

a. Random Binary Sequence 

b. A-4D Leam and Test Data 

c. Damaged A-4D Leam and Test Data 
Neural Network Configuration 

Network Validation 








A. INTERPOLATION BETWEEN DAMAGE MECHANISMG. ..........cccsscseee 50 
B. EXTRAPOLATING FROM A DAMAGED AIRCRAFT ........sssssscsssssssssssers 59 
C. DETECTING AND RESPONDING TO DAMAGE ........cssssssscsssssnseassseesseees 61 
V. CONCLUSIONG. ...........ccccccccscccsescccccnsccsscccencccsessccesensscccecscesssceees 75 
APPENDIX A: MATLAB PROGRAMG. .............ccccccccsscsscseesscccessccsecens 76 
APPENDIX B: NEURALWORKS SETUP SPECIFICG.........ccccccccceeee 98 
LIST OF REFERENCE. .............cccccccccccscccsscscncncssccscsccesccnccsccssscsces 100 
INITIAL DISTRIBUTION LIST ...............cncccsscceccccccccccesccceccsseeses 102 





wR ee 


To my beautiful wife Joie, who makes life enjoyable. 











I. INTRODUCTION 


Designing modern aircraft controllers requires advance 
knowledge of aircraft dynamics. In order to determine these 
dynamics, the aircraft is usually modeled. First, the equations of 
motion (based on Newton's second law) are derived. Neglecting 
higher order derivatives and making small angle approximations, 
these equations are linearized around a trim position. The aircraft's 
equations of motion can then be written in the following state space 


format: 
X(t) = Ax(t)+ Bu(t) 
Y(t) = Cx(t) + Du(t) 
where, 
x(t) = state vector 
u(t) = input vector 
Y(t) = output vector 


Although the previous steps can be carried out to a high degree of 
accuracy, the aerodynamicist’s job is still daunting. Using wind 
tunnel tests on scale models, the numerous partial derivatives must 
be obtained in order to compute the state space matrices (A, B, C and 
D). Since a simple model of an aircraft is at least eighth order, and 
some models reach more than fiftieth order, this makes determining 


the partial derivatives an arduous task. Once the derivatives are 








known, the transfer functions can be computed and, therefore, the 


static and dynamic response of the aircraft can be determined. 

These responses, however, are valid only around the trimmed 
position. In order for the controller to function throughout the 
aircraft's entire flight regime, several other trim positions, or 
flight conditions, must be defined, the equations of motion must be 
linearized around these new trim positions, and the new 
aerodynamic derivatives must be determined or estimated. This 
results in additional A, B, C, and D matrices. The more trim 
positions selected, the more schedules or plants the controller has 
to pick from and, therefore, the more robust the controller. 

A problem arises, however, when the aircraft is damaged. The 
controller is hard-wired with the modeled data, but that data is no 
longer valid. The aerodynamic derivatives of the aircraft have 
changed, but those used by controller have not. A solution to this 
problem is to make the controller more robust, i.e., include more 
trim positions and plant variations in its development. However, 
identifying every possible damage mechanism and its effect on every 
specific aerodynamic derivative is prohibitive. Also, the complexity 
of the controller and the amount of effort required to obtain all of 
the additional derivatives makes this method impractical. 

Recently, a new method of modeling has become feasible: 
artificial neural networks. Artificial neural networks excel in 
generalizing and extrapolating. This could make them robust models, 


i.e., reducing the number of trim positions and damage mechanisms 





required to be identified and properly modeled to define an aircraft's 
motion throughout its flight regime. Earlier work in this area has 
proved the validity of artificial neural network controllers 

{SCOT 89] [DROR 92]. This thesis will investigate the use of 


artificial neural networks to model damaged aircraft. 








ll. NEURAL NETWORK FUNDAMENTALS 


The computer, with its ability to carry out millions of 
instructions per second, does many things amazingly well. its 
strength lies in the ability of its microprocessor to carry out simple 
arithmetic at an extremely fast rate while maintaining a high degree 
of accuracy. The human brain is not as proficient at performing 
simple calculations. It excels in pattern recognition and the 
extrapolation of data from pattems, far surpassing the ability of 
computers. The idea behind the artificial neural network is to use 
the speed and accuracy of the computer to imitate some of the 
brain's functions. It is, therefore, not coincidental that artificial 
neural networks are patterned on a crude approximation of the 
human brain. 


A. BIOLOGICAL NEURONS 

The fundamental cellular unit of the human nervous system is the 
neuron. The brain is made up of trillions of interconnected neurons 
forming a neural network. As shown in Figure 2-1, a neuron receives 
signals from several other neurons via its dendrites or input 
tentacles. The neuron processes these inputs in the cell body and, if 
the combined sum of these signals is strong enough, it “fires," 
sending an output signal to other neurons' dendrites via its axon or 


output tentacle. This process is not as simple as it seems because 


the axon and dendrites do not connect; they intermingle at synaptic 








Figure 2.1 Human Neuron 


junctions which have various strengths (or gains) depending on the 
gap size and the properties of the synaptic cleft material. Also, the 
signals sent are actually chemicals, of which more than thirty have 
been identified [WASS 89 p. 12]. Some chemicals excite the cell 
body to fire and some inhibit firing. As the chemicals (inputs from 
the dendrites) mix in the cell body, a certain level of activation is 


reached which causes the cell to fire. When a cell fires, it picks a 











specific chemical and concentration to send out the axon, which is 
released it into the synaptic cleft for the next group of dendrites to 
pick up. 

As a signal processing unit, the neuron is slow. A neuron must 


wait approximately one millisecond between firings. For 


comparison, a standard 33 MHz computer processes one byte of 
information every 30 nano seconds, thirty thousand times faster. 
The human nervous system gets its overall speed from the extreme 
parallelism of the neurons. It is estimated that in one meter of 
neural pathway there are 100 billion neurons, and more than 1000 
trillion interconnections [WASS 89 p. 194]. Because of this parallel 
processing capability, the brain's neural network can recognize 
patterns, learn from experience, see through noise and distortion, 
generalize from previous examples, self-adjusi, and abstract 
essential characteristics much faster and much better than a 
computer. in an effort to get computers to be able to do these 


things, artificial neural networks have been developed. 


B. PROCESSING ELEMENTS 

‘gure 2.2 shows the processing element, the fundamental 
component of an artificial neural network. Based on the design of 
the human neuron, the processing element has inputs (X), weights 
(W), a summer (£), an activation function (F) and an output (Y). 


Mathematically: 











Y=F(W-X) 


Y = output (scalar) 

F = activation function 
X = input (vector) 

W = gain (vector) 


where, 








Figure 2.2 Processing Element 


The input vector, X, is multiplied by a gain vector, W, to produce 
an activation level, WX (scalar). In other words, every input to the 
processing element has a unique gain associated with it and the sum 


of the products of the inputs with their respective gain is the 





activation level. This level is applied to an activation function, F, to 
produce the output, Y. The activation function can be a threshold 
function, which fires at a certain level, or a continuous function. 
Although it can be linear, the threshold function is usually nonlinear. 
In fact, in order to achieve any advantage from using multiple layer 
networks, the threshold function must be non-linear, otherwise, 
multi-layer networks reduce to just two layers, defeating the 
purpose of creating multi-layer networks. [WASS 89 p. 19] Often, 
the activation function is a type of squashing function, i.e., the 
output of the processing element is never allowed to exceed a 
certain value. The two most prevalent types of squashing functions 


are the sigmoid and hyperbolic tangent: 





Fk) =—- Ft) = £8 respectively 
1+e™* e+e% ' : 


with derivatives: 
e* -e* 2 
F’ &) = ——_—__— F &) =1- Ge) , respectively, 
e e 


where, | xX = excitation level (WX). 


These plots are shown in Figures 2.3 and 2.4. The sigmoid 


function is normally used when the input to the processing element 


is non polar (i.e., negative or positive values only), while the 









hyperbolic tangent function should be used whenever the input is 
bipolar (both negative and positive values). 


Derivative of Sigmoid Function 








Figure 2.3 Sigmoid Function and Derivative 








Figure 2.4 Hyperbolic Tangent Function and Derivative 


Activation functions can be thought of as the nonlinear gain of 
the processing element, where the gain of the processing element is 
proportional to the slope, or derivative, of the activation function 
evaluated at the particular level of excitation. Since processing 
elements often deal with very small as well as very large inputs, 
using the sigmoid or the hyperbolic tangent as the squashing 
function, gives large gains for small signals and small gains for 
large signais. This was Grossberg’s solution to his own dilemma. A 


10 


small signal required a large gain, but since processing elements 
were connected in series, if a fixed gain was chosen to properly 
amplify the small signals, then when the larger signals were 
applied, the downstream processing elements would saturate. In 
order for the network to respond equally well to both large and 
small input signals, a varying gain was needed. Grossberg found that 
these squashing functions provided a proper gain over a wide range 
of input levels, thus helping to prevent saturation of the artificial 
neural network. [WASS 89 p.19] 

The sigmoid and hyperbolic squashing functions also possess an 
important quality; their derivatives can be expressed in terms of 


their original functions: 


F &x)=F)-F&)* and F’&)=1-F)*, respectively. 


During back-propagation (discussed later), the rate of change, or 
derivative, of the output of every processing element in the 
artificial neural network has to be calculated and evaluated at the 
current excitation level of tha: processing element. Since the 
sigmoid and hyperbolic tangent functions have derivatives that can 
be expressed in terms of their original functions, the derivative of 
the output of the individual processing elements can be directly 
determined from their current outputs i.e., Y'=Y-Y2 (sigmoid) and 
Y’=1-Y2 (hyperbolic tangent) where Y is the output of the processing 
element. 





C. ARTIFICIAL NEURAL NETWORKS 

An artificial neural network consists of layers of processing 
elements joined together in series. In a fully connected system, the 
output of every processing element in one layer is connected to the 
input of every processing element in the next layer via an amplifying 
weight. Typically there is an input er buffer layer, one or two 
hidden layers, and an output layer. The hidden layer is so called 
because there is no direct access to it, i.e., it is not directly 
connected to the input or output. Figure 2.5 shows a typical 
artificial neural network with four processing elements in the input 
layer, eight processing elements in the hidden layer and two 
processing elements in the output layer. The output of every 
processing element in one layer is connected to the input of every 
processing element in the next layer, fully connected, and a weight, 
or gain, is associated with each connection. Feedback loops and 
bypass loops are possible as well. Note that a bias is shown. The 
purpose of the bias is to provide a trainable offset to the origin of 
the squashing function. This results in faster leaming. [HECH 89 p. 
§3] 


12 




















Figure 2.5 Artificial Neural Network 


D. ARTIFICIAL NEURAL NETWORK OPERATION 

An artificial neural network possesses no inherent knowledge. 
After the structure of the network is determined, the weights are 
initialized to random values as a precursor to the learning process. 
(If they were not randomly initialized, then all of the weights would 
be altered by the same amount and no leaming would occur.) After 
learning, the network must be tested for proper operation. 

In a linear time invariant system, an output is obtained by 
convolving the input with the impulse response of the system. in the 
frequency domain, the transfer function, H(s), is the ratio of the 
output to the input signal. 


13 














x(t) hit) y(t) 


yt=x(t)@h(t) and H(s)= Ze) 
X(s) 


In comparison, a neural network is given an input and output 
sequence and attempts to lear the mathematical relationship or 
transfer function of the two sequences. 
1. Learning 
Learning is the process of modifying the weights of the 
connections between the processing elements such that the given 
input results in the desired output. There are three types of 
learning: supervised, unsupervised and reinforcement. Supervised 
learning, discussed below, was the type employed in this research 
a. Supervised Learning 

During supervised learning, an artificial neural network 
is provided with an input sequence along with the desired output 
sequence. If the input sequence is identical to the desired output 
sequence, then the network is auto-associative. If the input 
sequence is different from the desired output sequence, as it was in 
this research, then the network is hetero-associative. 

When the first record of the input sequence is applied to 
the network's buffer layer, it computes an output based on the 
values of the randomized weight values and the network's 
configuration. This output is compared to the desired output and an 


14 








error signal is generated. Using this error signal, an algorithm 


alters the values of the weights to reduce the network’s global error 
and then the cycle repeats. If the error between the network's 
output and the desired output goes to zero or an acceptable low 
level, as more training pairs are applied to the network, then the 
network has learned the mathematical function connecting the input 
data with the output data. 

Usually the network is trained over a large number of 
desired input/output pairs in order to generate a range or frequency 
response to the data. For example, if the elevator of an aircraft is 
randomly excited and the airspeed is recorded simultaneously with 
elevator position, then the two measurements make up an input and 
desired output pair. If the data is recorded fast enough, inherent in 
these two sequences is the transfer function from elevator to 
airspeed. if the sequence is long enough, and the proper excitation 
was applied, then the transfer function of the aircraft’s airspeed to 


elevator input can be determined. If, 


X(t) = Ax(t)+B6,(t) 


Y(t) = Cx(t) 
then, ¥(s) = C(sI— A)'B6,(s) 
where, C(sI-A)"'B is the transfer function 


from 5e(s) to Y(s). 


15 











2. Testing 

Testing is required to validate a neural network's leaming. 
During testing the weights are fixed to those values obtained during 
the training phase, and another input sequence is applied to the 
network. The output of the neural network is compared to the 
desired output and the error is measured. If the error is sufficiently 
small, then the network has indeed learned the transfer function of 
the data. 

Normally, the testing sequence is different from the learning 
sequence. This is done to test the network's ability to generalize to 
data upon which it has not actually trained. There have been many 
occasions that networks have adequately learned from the training 
sequences, but could not extrapolate to the test data. Since the 
training sequence is not a continuous function, and, since it is not 
infinitely long, it does not contain all of the possible combinations 
of input and desired output pairs. If the neural network has learned 
the mathematical relationship expessed in the training data, and the 
test data has expressed that same relationship, then the network 
should be able to determine the correct output associated with the 
test data. Since artificial neural networks are designed to 
generalize, if a network leams too well during training, it might not 
be able to generalize when tested on data it has not specifically 
leamed. As a result, there is a trade off between leaming and 


generalizing. 


16 








3. Learning Algorithm 

The leaming algorithm (or learning rule) determines the 
manner by which the weights are adjusted during learning. The best 
method for learning, i.e., reducing the network's output error to the 
smallest value, may not necessarily be the fastest. And with data 
set. hundreds of thousands of lines long, processing time is an 
important consideration. There is also the problem of global verses 
local minimum. The algorithm may find a weight matrix that 
minimizes the error, but the weight could be for a local minimum 
and not a global minimum. Presently, there is no known leaming rule 
which can determine a weight matrix which guarantees a global 
error minimum for the a network based on the back-propagation 


technique. 


E. BACK-PROPAGATION 

There are many different artificial network types, each with 
several learning algorithms and each with specific advantages and 
disadvantages. Only the back-propagation technique and the delta 
rule algorithm, used in this research, will be discussed. 

The back-propagation technique was discovered by David 
Rumethart [RUME 86] and David Parker [PARK 85], separately, as a 
way of solving the credit assignment problem posed by Minsky and 
Papert in Perceptrons [MINS 69]. The perceptron was the first type 
of artificial neural network to gain wide spread importance. It was 
flawed because it was unable to solve problems that were not 


linearly separable. Complex multi-layer perceptrons could be built, 


17 








but, in order to solve non-linearly separable problems, a method of 
determining which processing element caused the error in the output 
needed to be determined. The back-propagation algorithm solves 
this problem by assuming that all of the processing elements are 
responsible for the error and, therefore, requires modification of all 
of the weights in the network. The error between the desired output 
and the network’s computed output is fed back through the network 
layers, starting at the output layer, adjusting the weights, layer by 
layer, until the input layer is reached, hence the error is 
“back-propagated” through the network to adjust the weights. 

The typical back-propagation network has an input layer, an 
output layer and one or more hidden layers. Although there is no 
limit on the number of hidden layers required for a network to 
properly fearn a transfer function, there is some analysis which 
shows that only three hidden layers are required in order to solve 
complex classification problems [NEUR A 93 p. 63] 

1. Generalized Delta Rule 

Although back propagation is the method by which the 
weights in the network are modified, the specific values by which 
the weights are changed are determined by the leaming rule. The 
learning rule employed in this research was the generalized delta 
rule. This rule requires that the learning be supervised. 

Leaming using the generalized delta rule can be broken down 
into four steps. The first is the forward pass. During this step, one 


record of the training sequence is applied to the network and, using 


18 





the networks configuration and initial weights, an output is 


generated. Next, the error between the network's output and the 


desired output is computed. This can be expressed in vector form as: 


E=Y,-Y, 


where, E 
Ya 
Ya 


error (column vector) 
desired output (column vector) 
actual output (column vector) 


Multiplying this error by the derivative of the squashing function 
evaluated at the excitation level of the output layer gives a modified 
error or delta, for the output layer: 


5, = (1* E)*F(W,X,) 


where, & = modified error (column vector) 
1 = identity matrix 7 
E =error (column vecior) 
F’ = derivative of the activation function 
(hyperbolic tangent) 
Wo, = output layer’s weight matrix 
X = input vector to output layer during 
forward pass 
but, F (XW. )=1-FW,)’, for the hyperbolic 


tangent function 


therefore, 6, =(I*E)*[1-F(K,W,)"] 


19 


The final step is the back propagation of this modified delta to 


generate the change in the weight matrix. 


AW, = No *F(WX.) *5.7 


where, AW, = change in output layers weight matrix 
No = learning rate coefficient of output layer 
(scalar) 


To determine the weight adjustments for the hidden layer, 
the modified error, 50, is fed backwards through the weight matrix, 
in order to calculate a desired output for the hidden layer, W,75,. A 


modified delta for the hidden layer can then be computed, 


where, & = modified error of hidden fayer 
Wn = hidden layer’s weight matrix 


but, F(X, W,)=1-FX,W,), for the hyperbolic 
tangent function 


therefore, 5, =(1*W,75,)*[1-FX, W, )”] 


20 








and the change in the weight matrix can then be determined as before: 


AW), = Th * F(WrXp)*5y" 


= 
& 
$ 
Z 
\ 


= change in hidden layer’s weight matrix 


leaming rate coefficient of the hidden layer 
(scalar) 


| 
+ 
Wt 


This process is continued until the input layer is reached. 
The leaming rate coefficient, n, is a scaling factor to cause 


different layers to learn at different rates. 


...using different learning coefficients for each layer in a multi-layer 
network can decrease learning time. In particular, having a larger 
learning [rate] coefficient at the hidden layer than for the output 
layer allows the hidden layer to form feature detectors during the 
early stages of training. ... With large learning rates, a network may 
go through large oscillations during training. In fact, if the rates are 
too large, the network may never settle or converge. Smaller rates 
tend to be more stable. [NEUR C 93 p. 54] 


21 








ili. EXPERIMENTAL PROCEDURE 


A. HARDWARE-SOFTWARE 

This experiment was carried out on a Sun SPARC2 workstation 
(15 MIPS) with 64 megabytes of RAM and 2.2 gigabytes of hard disk 
space. The software used for the design, learning and testing of the 
neural network was NeuralWorks II (version 5.0). MATLAB (version 
4.1) was used extensively to create data files and to correlate and 
plot output data. 

Even though the Sun workstation is considered a fast computer, 
some data runs required over an hour to complete. Since the 
NeuralWorks program is attempting to create a parallel processor 
using a single micro-processor, it requires a significant amount of 
RAM. in order to reduce run time, data files were converted into 
binary and loaded into RAM by the NeuralWorks program. Also, since 
the data files for the neural network were ten megabytes in size, a 
large external storage capacity was needed. This validates the use 
of a UNIX machine with such a large RAM and disk storage capacity. 

NeuralWorks is a software program which simulates a parallel 
processor (hardware). It is a design tool which can be used to 
determine the size and configuration of an artificial neural network 
and can be used to extensively test and modify the network design 
before the network is constructed. Since the program is using a 


single processor to model multiple parallel processors, the software 


22 








version of the network runs much slower than an actual parallel 


processor. The data from NeuralWorks can be downloaded in order to 


physically build the designed artificial neural network. 


B. OVERVIEW 

The artificial neural network’s training and testing files were 
generated using MATLAB. This was accomplished by creating a 
random binary signal and using it to excite a digitized state space 
model of an A-4D. The output values of the state space model of the 
A-4D [u, alpha, q, and theta], were then organized and recorded in 
data files. Using NeuralWorks, the training data file was presented 
to the artificial neural network until satisfactory learning occurred. 
Next, a test file was generated in the same manner and was 
presented to verify the leaming. The artificial neural network’s 
output was recorded and MATLAB was used again to perform an 
analysis of the test data. 


C. SETUP SPECIFICS 
1. Modeling of A-4D Longitudinal Motion 

The A-4D was chosen as the investigation platform for 
artificial neural network modeling. This aircraft is a US Navy 
attack aircraft weighting approximately 17,600 Ibs. The aircraft 
can be modeled in state space as an eighth order system. This model 
can be linearly separated into two fourth order modes: longitudinal 
and lateral. After considering the complexity of the problem as well 
as the memory limitations of the available computer system and 


processing time, it was determined that an analysis of just the 


23 








longitudinal mode would be adequate. Using the data from Aircraft 
Dynamics and Automatic Control, [MCRU 93] flight condition 8 (400 
kts, 35,000 ft) the state space representation of the A-4D 
longitudinal mode were created using MATLAB. 


X(t) = Ax(t) + Bu(t) 








Y(t) = Cx(t) 
where, 
x, Xa ‘ -g*cos (8) 
Uy Uo 
u,*Z, Z; uy +Z, —g*cos (0) 
A= Uy ~ Zy Uy — Ze Uy — Leg Uy ~ Ze 
M.,, u, + Z, 
4M, +e ae M, + Mata. Sig ee ae a) 0 
Uy —- Ze Uo ~Z, Ug - 2a 
0 0 
Xe 
Uy 
Lie 
B= Uy — Ly 
My, + eo ob 
Uy ~Zy 
0 
1000 
0100 
a 0010 
0001 


24 


airspeed pertubation 

angle of attack pertubation 
pitch rate pertubation 

pitch angle pertubation 


al 
iW 
@ OQ QE [kz 


U = de elevator deflection 


The bode diagrams in Figure 3.1 show the dominance of the 
two primary modes of the A-4D longitudinal dynamics. 


Bode Plot of u 


10" 10° : 10" 10° 
Frequency (rad/sec) Frequency (rad/sec) 


Bode Plot of q Bode Pitot of Theta 


— <a i owe, salaiee ne ; -40 ec. Sane i ate ese =a ae 
10 10 10 10 10 
Frequency (rad/sec) Frequency (rad/sec) 





Figure 3.1 Bode Plot of A-4D 


25 


The short period mode occurs at a frequency of 2.77 rad/sec 
and the long period or phugoid mode occurs at 0.075 rad/sec. Low 
frequencies dominate the response in airspeed and pitch angle while 
high frequencies dominate the response in angle of attack and pitch 
rate. The eigenvalues of the plant are summarized in Table 3.1. 
Notice that all of the eigenvalues are stable. Although it is possible 
to model an unstable plant with an artificial neural network, doing 
so would only add another level of complexity to the artificial 


neural network model. 


TABLE 3.1 
EIGENVALUES OF A-4D 


Eigenvalues aaeainn Freq. (rad/sec 










; | (sec) | 
| -o.e245+2.c008 | 0.2254 | 27701 | 2 | 
| -0.6245-2.60881 | 0.2254 | 27701 | 2 | 
| -0.0088+0.07471 | 0.1177 | oo7se | 83.5 _| 
| -0.008e+0.07471 | 0.1177 | oo7s2 | 35 _| 


Using MATLAB, the state space matrices were digitized with 





a sampling frequency of 50 Hz, 10 times the highest frequency of 
interest, two decades above the short period mode, thereby 
satisfying the Nyquist criteria. The digitized matrices were then 
inserted into the MATLAB Simulink diagram shown in Figure 3.2 in 
order to compute the output state values. The correlation between 
the analog and digital A-4 plant is shown in Figure 3.2. Notice that 


26 





the bode plots of the two plants are indistinguishable (they appear 
as a single line) and are, for all practical purposes, identical. 

MATLAB files A4_cond3.m, A4_matrix.m and A4_bode.m in 
Appendix A were used to model the A-4D. 


Airspeed 


x(n+1)=Ax(n)+Bu(n) 
n)=Cx(n)+Du(n 


Input Discrete A-4D 
[t,elevator position] State Space Pitch Rate 


Pitch Angie 





Figure 3.2 Simulink Representation of A-4D. 


27 














— 2 8 a pees set 40 ae —— Seer a Soe emus heinwal : 
10 10 10 10 10 10 
Frequency (rad/sec) Frequency (rad/sec) 
Bode Plot of q Bode Piot of Theta 


Se 5 i eng aie ete es ~4ol : ——— . ss te tea toee! 
10 10 10 10 10 10 
Frequency (rad/sec) Frequency (rad/sec) 





Figure 3.3 Comparison of Continuous and Digital Plant 


2. Generation of Learn and Test Files 
a. Random Binary Sequence 

A random binary sequence, [+1,-1], was constructed to 
excite the digitized A-4D model. The primary consideration in its 
design was its power spectral density or frequency content. Since 
there is a frequency band of interest for the A-4D, the random 
binary’s power spectral density was designed to match it. This 
keeps the energy of the random binary signal in the frequency band 
of interest and suppresses the aliasing from higher frequencies. 


28 

















Since the short period of the A-4 occurs at 0.45 Hz and 
the phugoid mode occurs at 0.012 Hz, the random binary sequence 
was designed to have a frequency content from 0.001Hz to 5 Hz; i.e. 
from one decade above, to one decade below the two primary modes. 
The period of the lowest frequency was used to determine the time 
length of the sequence, i.e., 1000 seconds, one complete cycle of 
0.001 Hz. Since a random binary sequence in the time domain 
(triangular), transforms to a sinc function, sin(x)/x, in the frequency 
domain, the frequency of the random binary sequence corresponds to 
the first cross-over of its sinc function. {in order to get a bandwidth 
of 5 Hz, the cross-over was chosen to be twice that or 10 Hz. 

Since the digitized model of the A-4D was created using 
a sampling rate of 50 Hz, an adjustment to the random binary signal 
was needed, i.e., instead of one random number every 0.1 seconds, 
the same number was repeated 5 times every 0.02 seconds and then 
a new one was selected. This maintained the same designed random 
binary frequency and power spectrum. The first three seconds of the 
1000 second long random binary signal are shown in Figure 3.4, and 
the power spectral density plot of the entire sequence is shown in 


Figure 3.5. 


29 


























10" 
Frequency (Hz) 





Figure 3.5 Power Spectral Density of Random Binary Sequence 


Notice that the energy in the signal rolls off (-3dB) at 5 
Hz and the first cross-over frequency is at 10 Hz. There are two 
other notable observations to be gained from the power spectral 
density plot. First, the plot is not smooth like a sinc function, but 
rather noisy. This is primarily due to the signal not being a 
continuous function but also due to the randomness of the signal and 
the size of the Hamming window (25000) selected for the power 
spectral analysis. Secondly, there in no defined DC (zero frequency) 


gain. This is because the sequence is finite in length (time). 





Since this signa! will be eventually applied to a neural 
network, it is important to realize that the 1000 second (16.67 
minute) time frame is 50,000 points long. Also, during the 1000 
seconds, one would expect the 0.001 Hz signal to be represented only 
once, while the 5 Hz signal would appear 5000 times. This fact 
becomes important when comparing the results of the neural 
network. 

MATLAB files Rand_seq.m, Rand_plot.m and 
Rand_short_seq.m in Appendix A were used to generate the random 
binary signal. 

b. A-4D Learn and Test Data 

Previous research has been done using a user 
input/output program for the generation of data for an artificial 
neural network [SCOT 89] [DROR 92]. The user input/output program 
was capable of generating a time varying model of an aircraft, 
providing the neural network an infinite amount of learning if 
necessary. In order to provide a finite data set upon which to train 
and test the neural network, data files were used. As mentioned 
earlier, these files represent 1000 seconds of test data which came 
from a MATLAB A-4D model. Flight test data or wind tunnel data 
could just as easily have been used. 

The leaming and testing data used by the neural network 
for modeling the non-damaged A-4D was created by running the 
MATLAB program Gen_A4_data.m (Appendix A). This program loads 


the A-4D data, creates the digitized state space matrix, calls up the 


32 





random binary sequence, 11s a simulation to generate the measured 
(sensor) outputs, organizes the data for the neural network, and 
stores the data in an ASCII file (Learn.nna). It then changes the seed 
value of the random binary sequence, and repeats the steps above to 
create the test file (Test.nna). These files consist of 50,000 rows 
and 24 columns and require approximately 10 megabytes of disk 
storage space each. The program also compares the two files, 
determines the maximum and minimum value of each file and saves 
these values in a third file. This file is considerably smaller than 
the other two. Even though NeuralWorks can generate its own 
minimum and maximum values from the data, the learn and test files 
created were too large for the NeuralWorks software to do 
automatically. 
c. Damaged A-4D Learn and Test Data 

The MATLAB program Gen_damaged_data.m (Appendix A) 
computes the damaged A-4D data. This program is the same as the 
one used to generate the non-damaged A-4D data, except, to 
simulate a damaged A-4D, the aerodynamic derivative Mge, the 
change in pitching moment due to a change in elevator deflection, 
was arbitrarily reduced by 70 percent. Figure 3.6 depicts the bode 
plot of the damaged A-4D model. 


33 





Bode Plot of u 


10" 10° 


Frequency (rad/sec) 


Bode Plot of q 


10" 10° 
Frequency (rad/sec) 


Bode Plot of Alpha 


10 10 10 
Frequency (rad/sec) 
Bode Plot of Theta 


10 10 10 
Frequency (rad/sec) 





Figure 3.6 Bode Plot of Damaged A-4D 


Comparing this response to the one in Figure 3.3, certain 
commonalties are apparent. First, the two primary modes, occur at 
the same frequency. This is because Ms_ only appears in the B 
matrix, which does not affect the eigenvalues of the plant. 

Secondly, the plots have the same general shape. The only difference 
is the amplitude of the peaks. Although the amplitude shift appears 


to be linear, it is not. From state space: 


X(t) = Ax(t) + Bu(t) 
Y(t) = Cx(t) 


34 








the transfer function is 
C(sI-A)"'B. 
A linear change in the B matrix results in a linear shift in the 


amplitude of the bode plot but changing Ms. does not change the B 


matrix linearly: 








Mée was chosen as the damage mechanism to model 
because of its connection to a realworld damage mechanism; a 
reduction in Msge is akin to a partial loss of elevator surface area due 
to battle damage. Other models using different values of Ms_ would 
represent different amounts of damage to the elevator. As one 
might expect, a reduction in Mse is prevalent in all four of the 
longitudinal states (See Figure 3.7: The solid line represents the 
undamaged aircraft, the dotted line represents Mse reduced 30 
percent and the dash-dot line represents Ms, reduced 70 percent.) 


35 


Step Response in Angie of Attack 
0 : 


100 200 300 400 
Time (sec) Time (sec) 
Step Response in Pitch Rate Step Response in Pitch Angle 
5 ' 


200 
Time (sec) Time (sec) 





Figure 3.7 Step Response with Original Mse And with Mse Reduced 
30 and 70 Percent 


36 











3. Neural Network Configuration 

The neural network model for the A-4D is depicted in Figure 
3.8. It was built using the “instaNet” menu in NeuralWorks. The 
model has 20 processing elements in the input layer, one hidden 
layer with 20 processing elements and an output layer with 4 
processing elements. The specifics of the NeuralWorks setup is 
provided in Appendix B. 

Since the longitudinal A-4D digital plant is of fourth order, 
four consecutive values of all of the inputs and outputs were used 
for learning. These inputs and outputs are listed in Table 3.2. 
Although, a fourth order artificial neural network was used to model 
a fourth order plant, this is not a requirement. Dror obtained very 
good results modeling a 25th order F/A-18 with a third order neura! 
network [DROR 92]. 


TABLE 3.2 
INPUTS AND OUTPUTS FOR THE NEURAL NETWORK 


po inputs tp ut_] 
_ sem) | ey | gm | am | oir) fir 


Be(n-t) | u(n-t) | gin-t) | o(n-t) | a(n-t) 
Be(n-2) were) | gia | oie) | alee) a(n) 


_sen-a) | uyn-3) | gina) | o(n-a) | aia) | ary 


Optimizing the minimum number of processing elements in 











the hidden layer was not performed. Pruning the network may have 


led to faster learn times, but this was not the focus of the thesis. 





37 













Bees 
wey 


et 
as 
Rey? as oye 
EN HA? A PAS 
nRepre 
axes 
ry 


RRETT OPE 
i Es | 
p ; 


33 








aseas: 
wu 


. os 
or 





ye or 
ay yee 


Nee 







low 
> 


Ae 





38 


Figure 3.8 Artificial Neural Network of A-4D 








4. Network Validation 


After the artificial neural network has been fabricated, it 





must be trained. The first question to be answered is how much 
training is necessary. Under-training a r.:twork results in an 
incorrect output while over-training a network reduces the 
network's ability to generalize. 

First six identical artificial networks were built. The first 
was trained on one pass of the learning file (50,000 records), the 
second on two passes (100,000 records), the third on three passes 
(150,000 records), the fourth on four passes (200,000 records), the 
fifth on five passes (250,000 records) and the sixth on ten passes 
(500,000 records). These were all tested with the same test file, 
and a power spectral density analysis was performed on the results. 
Figure 3.9 through Figure 3.14 show these results plotted along with 
the bode plot of the correct response. Although no over-training can 
be observed, it is evident from these plots that the network has 
learned the transfer function and that there is only slight 
improvement between training on 4 passes (200,000 presentations) 
and training on 10 passes (500,000 presentations). Since the goal 
was to find the number of passes which adequately trained the 
network without over training, 4 passes (200,000 presentations) 
was considered adequate. The slight improvement in performance 
using 10 passes did not outweigh the increase in computer run time 


required. 


39 








Also notice that the low frequency response of the artificial 
neural network is not as smooth as the high frequency response. 
This is due to the fact that the test data applied to the network 
corresponds to 1000 seconds of time. in 1000 seconds, a .001 Hz 
signal is represented only once while a 10 Hz signal is represented 
10,000 times; therefore, the network is trained ten thousand times 
more on a 10 Hz signal than a .001 Hz signal. Low frequency 
components are presented to the network fewer times than high 


frequency components. 


Bode Plot of Alpha 


10° 10° 10’ ; 10° 10° 10° 


Frequency (rad/sec) Frequency (rad/sec) 


Bode Plot of q Bode Plot of Theta 





Figure 3.9 Network Training (one pass) 


40 





Bode Piot of Alpha 


1 


10" 10° 10 
Frequency (rad/sec) 





Figure 3.10 Network Training (two passes) 





41 





10" 10° 10° 


Frequency (rad/sec) 
Bode Piot of q 


3 — S xe — mg Be -sisteee ; -40 a? e's a Fas i. 2 eaeac ed ; 
10 10 10 10 10 10 
Frequency (rad/sec) Frequency (rad/sec) 





Figure 3.11 Network Training (three passes) 


42 











Bode Piot of Aipha 


10" 10° 
Frequency (rad/sec) 


Bode Plot of q 


10" 10° 10' 10° 10° 
Frequency (rad/sec) Frequency (rad/sec) 


10" 





Figure 3.12 Network Training (four passes) 


43 











Bode Piot of u 


<< . —— pecs -40 : a cae = eS eis ties 
10 10 10 10 10 
Frequency (rad/sec) Frequency (rad/sec) 
Bode Pilot of q Bode Plot of Theta 


10" 10° 
Frequency (rad/sec) 





Figure 3.13 Network Training (five passes) 


44 


ss messi a 4g RE 
10 10 10 10 10 
Frequency (rad/sec) Frequency (rad/sec) 


Bode Plot of q Bode Plot of Theta 


10° 10" 10" 10° 10" 
Frequency (rad/sec) Frequency (rad/sec) 





Figure 3.14 Network Training (ten passes) 


45 








After determining the number of training passes required, 
one of the artificial neural networks was reinitialized and the 
damaged A-4 data was presented. Again, the network was trained on 
4 passes of the data (200,000 presentations). Figure 3.15 shows the 
result. Notice, the network has learned the damaged plant. As 
expected, the bode plots are similar in shape and shifted in 
amplitude. 


Bode Pitot of u Bode Plot of Alpha 


Se Ae gh mai: J 
10 10 10 10 10 
Frequency (rad/sec) Frequency (rad/sec) 
Bode Plot of q Bode Plot of Theta 


10" 10° 10° 10° 10" 
Frequency (rad/sec) Frequency (rad/sec) 





Figure 3.15 Frequency Response of Network Trained on Damaged 
A-4D 


46 








Next, both the damaged and undamaged training files were 
combined to see if the network could leam the transfer functions of 
the two plants simultaneously. It had been pointed out in earlier 
work [DROR 92] that to do this the files had to be intermingled. As a 
result, the two files were intermixed every 20 records, i.e. 20 
damage, 20 undamaged, 20 damaged, etc. (see Gen_both_data.m in 
Appendix A). It was then tested on the test file for the undamaged 
aircraft and on the test file for the damaged aircraft. Figure 3.16 is 
the spectral density of the network's output using the test file for 
the undamaged A-4D and Figure 3.17 is the spectral density of the 
network’s output using the test file for the undamaged A-4D. 

Notice that they correspond very well to their respective 
bode plots. The network has simultaneously learned eight transfer 
functions: four for the damaged A-4D and four for the undamaged 
A-4D. 


47 


Bode Plot of u 


10" 10° 


Frequency (rad/sec) 


10" 


Bode Plot of q 


10° 


Frequency (rad/sec) 


Bode Piot of Alpha 


10° 10° 
Frequency (rad/sec) 


Bode Plot of Theta 


10" 10° 
Frequency (rad/sec) 





Figure 3.16 Network Trained on Damaged and Undamaged A-4D; 
Tested with Undamaged A-4D 











Bode Pilot of u 


10° 10° 
Frequency (rad/sec) 


Bode Plot of q 


10" 10° 
Frequency (rad/sec) 


Bode Plot of Alpha 


10" 10° 


Frequency (rad/sec) 


10" 10° 


Frequency (rad/sac) 


Tested with Damaged A-4D 


49 


Bode Plot of Theta 


10 





Figure 3.17 Network Trained on Damaged and Undamaged A-4D; 














IV. RESULTS 


Since the purpose of this thesis was to determine the robustness 


of the artificial neural network as a model for damaged aircraft, 





three concepts were evaluated--interpolating, extrapolating and 


detecting. 


A. INTERPOLATION BETWEEN DAMAGE MECHANISMS 

With the neural network designed and trained on two plants--the 
damaged A-4D and undamaged A-4D--the network was tested on 
other damaged A-4D data to determine how it responded when the 
damage to the elevator was less than that for which it was trained. 
In order to investigate this, six damaged A-4D models were created 
to provide the test data for the artificial neural network using 10, 
20, 30, 40, 50, and 60 percent reductions in Mse, respectively. 

Figure 4.1 is the bode plot of the undamaged A-4 plotted along with 
the power spectral density of the neural network’s output. Figure 
4.2 through Figure 4.8 shows the bode plots of the respective 
damaged A-4D model along with the power spectral density of the 
artificial neural network’s output. 

The spectral density plots indicate that the neural network has 
extrapolated the transfer functions of all of the damaged A-4D 
models from the two models on which it was trained. While having 
been trained on only one value of damage, Ms. reduced 70 percent, the 


50 





Bode Pilot of u 


10" 10° 
Frequency (rad/sec) 


Bode Plot of q 


10" 10° 
Frequency (rad/sec) 


10 


1 


network has determined the transfer function of all of the damaged 


plants and, therefore, does not need to be trained on them. 


Bode Plot of Alpha 


10" 10° 


Frequency (rad/sec) 


Bode Plot of Theta 


10" 10° 


Frequency (rad/sec) 





Figure 4.1 Network Trained on 70% Damage and Undamaged A-4D; 


Tested with Undamaged A-4D 


51 


Bode Plot of u 


10° 10° 10’ 
Frequency (rad/sec) 
Bode Plot of Theta 


10 10 10 10 10 
Frequency (rad/sec) Frequency (rad/sec) 


Figure 4.2 Network Trained on 70% Damage and Undamaged A-4D; 
Tested with 10% Damage 








Bode Plot of u Bode Plot of Alpha 


10 10 10 10 
Frequency (rad/sec) Frequency (rad/sec) 
Bode Plot of q Bode Pilot of Theta 


10° 10° 
Frequency (rad/sec) 





Figure 4.3 Network Trained on 70% Damage and Undamaged A-4D; 
Tested with 20% Damage 








10" 10° 
Frequency (rad/sec) 





Figure 4.4 Network Trained on 70% Damage and Undamaged A-4D; 
Tested with 30% Damage 


54 








Bode Plot of Aipha 


10" 10° 
Frequency (rad/sec) 
Bode Pilot of Theta 


10° 10° 
Frequency (rad/sec) 





Figure 4.5 Network Trained on 70% Damage and Undamaged A-4D; 
Tested with 40% Damage 


55 





eee — aaa — Si: Sexeue ecsivel -40 ae Seer a — ee ‘ 
10 10 10 10 10 
Frequency (rad/sec) Frequency (rad/sec) 





Figure 4.6 Network Trained on 70% Damage and Undamaged A-4D; 
Tested with 50% Damage 


56 











Bode Piot of Alpha 


10 10° 10' 10" 10° 
Frequency (rad/sec) Frequency (rad/sec) 


Bode Plot of q Bode Plot of Theta 


-1 


a 4 eeestiree ch ehe ewes -40 me Ser aes ae iOS yagasess 
10 b | eo 10 10 
Frequency (rad/sec) Frequency (rad/sec) 








Figure 4.7 Network Trained on 70% Damage and Undamaged A-4D; 
Tested with 60% Damage 


57 





0 
Frequency (rad/sec) 


Figure 4.8 Network Trained on 70% Damage and Undamaged A-4D; 
Tested with 70% Damage 








58 








B. EXTRAPOLATING FROM A DAMAGED AIRCRAFT 

Since the artificial neural network could interpolate between the 
two transfer functions on which it was trained, it was tested to see 
how it responded when the damage to the elevator was more than 
that for which it was trained. In order to investigate this, two 
damaged A-4D MATLAB models were created to provide the test data 
for the artificial neural network: Ms. reduced 80 percent and 90 
percent, respectively. Figures 4.9 and 4.10 show the power spectral 
density plots of the neural network’s outputs along with the outputs 
of the test models. From Figure 4.9 it is evident that the network 
has extrapolated the correct transfer function. Figure 4.10, 
however, shows the extrapolation beginning to breakdown slightly in 
pitch rate. This is probably due to the extreme reduction in Mée , i.e., 
the elevator has been reduced in size by roughly 90 perce :. This 
could be because there is not enough elevator surface area left to 


effectively control the pitching moment. 


59 









Bode Plot of u 







10" 10° 
Frequency (rad/sec) 
Bode Piot of q 


OO cee meme tere 












“1 


10 10" 10° 
Frequency (ra 


Frequency (rad/sec) 


10° 
d/sec) 


Figure 4.9 Network Trained on 70% Damage and Uncamaged A-4D; 
Tested with 80% Damage 


60 


Bode Pitot of u 


10" 10° 
Frequency (rad/sec) 


Bode Plot of q 


10" 10° 


Frequency (rad/sec) 


10" 10° 
Frequency (rad/sec) 


10" 10° 
Frequency (rad/sec) 





Bode Plot of Alpha 


Bode Plot of Theta 


10" 


Figure 4.10 Network Trained on 70% Damage and Undamaged A-4D; 
Tested with 90% Damage 


C. DETECTING AND RESPONDING TO DAMAGE 

In the previous two sections, the neural network’s output was 
analyzed in the frequency domain to determine whether the network 
could interpolate and extrapolate. Here, the time response of the 
network is analyzed to confirm those earlier results. In Figure 4.11, 
10 seconds of the undamaged A-4D output [u, q, alpha, theta], of the 
neural network is plotted along with the true A-4D’s output. Note 


that the network tracks the true output almost exactly. 








5 
Time (sec) 
Pitch Rate 






5 
Time (sec) 
Pitch Angle 












6 
Time (sec) 


5 
Time (sec) 






Figure 4.11 Network Trained on 70% Damage and Undamaged A-4D; 


10 Seconds of Undamaged Response 


62 













In Figure 4.12 through Figure 4.20 the output of the artificial 
neural network is again plotted, except at t=5 seconds, the elevator 
is simulated to have been damaged, i.e., Mse is reduced. This was 
accomplished by switching the test data from the undamaged A-4D 
model to one of the damaged models. Plotted along with neural 
network’s responses are the desired responses provided by the 
respective aircraft models. From these plots it is evident that the 
neural network has not only identified that damage has occurred, but 


it has also correctly modeled the aircraft's response to the damage. 


63 





§ 
Time (sec) 
Pitch Angle 


5 
Time (sec) 


Pitch Rate 


Amplitude 
= ° 





Figure 4.12 Network Trained on 70% Damage and Undamaged A-4D; 
Mse Reduced 10% at t=5 


64 











Angle of Attack 









5 
Time (sec) 
Pitch Angle 






5 
Time (sec) 
Pitch Rate 






Amplitude 







5 
Time (sec) 


5 
Time (sec) 






Figure 4.13 Network Trained on 70% Damage and Undamaged A-4D; 
Ms. Reduced 20% at t=5 


65 








§ 
Time (sec) 
Pitch Rate 


Figure 4.14 Network Trained on 70% Damage and Undamaged A-4D; 
Mse Reduced 30% at t=5 









5 
Time (sec) 


Pitch Rate 





Figure 4.15 Network Trained on 70% Damage and Undamaged A-4D; 
Mse Reduced 40% at t=5 


67 








5 
Time (sec) Time (sec) 





Figure 4.16 Network Trained on 70% Damage and Undamaged A-4D; 
Mse Reduced 50% at t=5 


68 


5 
Time (sec) 
Pitch Rate 





Figure 4.17 Network Trained on 70% Damage and Undamaged A-4D; 
Mse Reduced 60% at t=5 





69 





5 
Time (sec) 





Figure 4.18 Network Trained on 70% Damage and Undamaged A-4D; 
Mse Reduced 70% at t=5 


70 








5 
Time (sec) 


Pitch Rate 


Amplitude 


5 5 
Time (sec) Time (sec) 





Figure 4.19 Network Trained on 70% Damage and Undamaged A-4D; 
Mse Reduced 80% at t=5 












5 
Time (sec) 


Pitch Rate 


Figure 4.20 Network Trained on 70% Damage and Undamaged A-4D; 
Mse Reduced 90% at t=5 


72 





In Figure 4.21, an expanded section the time plot with Mge 
reduced 60% is shown. Notice that the artificial neural network 
takes four time steps of 0.02 seconds or 0.08 seconds to correctly 
respond to the damage. Since the modeled A-4D is a fourth order 
plant, this is consistent. From the plot, a spike in airspeed is 
observed at the moment the damage is applied. Although the spike 
appears large, the vertical scale is small, therefore, the spike is 
insignificant. The spike is due to the fact that at t=5 seconds the 
network is using only one damaged data point and 3 undamaged data 
points to compute an output. Since all of the inputs to the network 
were high before the damage occurred, with a sudden drop at t=5 
seconds, the airspeed output spiked high. In a practical sense, if an 
aircraft were to have sudden decrease in angle of attack, and pitch 
rate, airspeed would increase. This is what the artificial neural 
network has determined also. Since the network has determined 
that pitch rate and angle of attack are high frequency modes of the 
A-4D, a sudden change in these values over ride sudden changes in 
airspeed and pitch angle, which are low frequency modes. The 
network expects fast changes in pitch rate and angle of attack, and 
responds to them, rather than to fast changes in pitch angle and 
airspeed, which it does not expect. When the same data was run 
with the damage occurring at t=6 seconds, there was a downward 


spike in airspeed. 


73 





Figure 4.21 Network Trained on 70% Damage and Undamaged A-4D; 


Mse Reduced 60% at t=5 (Time expanded) 


74 





V. CONCLUSIONS 


The designed artificial neural network was excellent at modeling 
a damaged aircraft. It interpolated between a good aiicraft and a 
damaged aircraft, and it extrapolated from a single damage 
parameter. Only one damaged aircraft model was required to 
correctly model all of the damage mechanisms to Msg. thus making 
the network a robust model. It determine when damage occurred 
automatically without any outside detection device and it responded 
rapidly and correctly to the new aircraft dynamics with the 
response time determined by the order of the plant and the sampling 
period. 

Further training of the network on damage mechanisms effecting 
the rest of the A-4D’s aerodynamic derivatives could lead to a 
network capable of responding to any possible aircraft damage, thus 


making the neural network an ideal robust aircraft model. 


75 


APPENDIX A: MATLAB PROGRAMS 














a aa a el a aa hl ac ie lg nd aol ea ed rated 


% Squashing.m 

cca cd ch BN a A iL aa sh hl a de a acd 
% This programs plots the two squashing funtions, sigmoid and hyperbolic tangent 

% Along with their derivaitves. 


SO SS RSS Oe SSS Se LES Oe SS OR SO AOS ORS RAS AS. Oe, 88 Ae Se ee ee 


subplot(211)x=-8:.1:8; 
fx=1.0./(1+exp(-x)); 
plot(x,fx) 

grid 

xlabel('x') 
ylabel('F(x)') 
title(‘Sigmoid Function’) 


subplot(212) 

=-8:.1:8; 
fx=exp(-x)./(1+eXxp(-x)).42; 
plot(x,fx) 

grid 

xlabel('x') 

ylabel('F’(x)') 

title('Derivative of Sigmoid Function’) 


print -deps sigmoid 


subplot(211) 
=-4:.1:4; 
fx=(exp(x)-exp(-x))./(exp(x)+exp(-x)); 
plot(x, fx) 
grid 
xlabel('x') 
ylabel('F(x)'‘) 
title(‘Hyperbolic Tangent Function’) 


subplot(212) 

xX=-4:.1:4; 
fx=1.0-((exp(x)-exp(-x))./(exp(x)+exp(-x))).42; 
plot(x,fx) 

grid 

xlabel('x') 

ylabel('F’(x)') 

title(‘Derivative of Hyperbolic Tangent Function’) 


print -deps hyperbolic 





77 


Basnatagesesse sees eeees sessions es es epee tee ress eres reyes 


% A4_cond3.m 


ARF EA ERSTE REAR PE SERGE SENET CHEATER RESO TSS Se 


% Data for A-4D flight condition 3 

Aes Seeeee werner secsenet ees ge esas eee ees seer SST eee ae ree et 
Thetao=0; 

UUo=0; 

Aiphao=0; 

qo=0; 


h=35000; 
M=.7; 
Q=171; 
W=17578; 
m=546; 
1x=8030; 
ly=25900; 
1z=29250; 
Ixz=-891; 
alphatrim=5.9*pi/180; 
g=32.174; 
U=681; 
Xu=-.01521; 
Xa=-16.68; 
Xd=0; 
Zu=-.1013; 
Za=-309.17; 
Zadot=-.613; 
Zd=-33.3; 
Zq=0; 
Mu=.000542; 
Ma=-7.423; 
Madot=-.206; 
Mq=-.592; 
Md=-11.33; 
Yb=-83.2; 
Yp=0; 

Yr=0; 
Yda=-.979; 
Ydr=14.11; 
Lb=-21.35; 
Lp=-.816; 
Lr=.523; 
Lda=11.74; 
Ldr=4.73; 
Nb=11.3; 
Np=.00845; 
Nr=-.321; 
Nda=.0837; 
Ndr=-4.69; 


78 


br a Aaa ih a a edd ir al acai l 


% A4_matrix.m 


i a a a ha al a AR se LR lh a Sa a i 


% This program was written to constiuct the State Space matrices for the 
% longitudinal mode of the A4-D. Using the dimensional derivatives, the 
% plant is constructed and digitized. States are: u, alpha, q, theta. 


be I Na aad al LE haa RR a i a I le ea a Ra 


% A Matrix 


kk=U-Zadot; 

A= 

(Xu Xa/U 0 -g*cos(Thetao)/U; 
U*Zu/kk Za/kk (U+Zq)/kk -g*sin(Thetao)/kk; 
U*Mu+(Madot*U*Zu)/kk Ma+Madot*Za/kk Mq+Madot*(U+Zq)/kk 0; 
0 0 1 0}; 


eenhrerkeaenteeaneeeeanekte eke nee eanneeteanene ee ane ht eteeeeaenee teat e nee aneneeaeaenee 


% B Matrix 


B=[(Xd/U Zd/kk Md+Madot*Zd/kk 0}'; 


SRS SORA Ee SS SES RRS OS SS OAS 8 88:8 88 8 O88 88 SS Ee #88 eae S88 8 eee 


% C Matrix 


Qe SE Se SSSR LSS SS SASS 8 88 8 OO 8:8 SOO O88 Ae 8 8 88S S88 8 88 8 O88 8:8 a 88 


% D Matrix 
D=zeros(size(B)); 


sy ii ath a ha a Ne Na Mh lig cad rete lade 


% Min sampling frequency = Nyquist freq(fn)= 2 X highest freq 
% present. f=w/(2*pi); fn=2*w/(2*pi)=w/pi. for w=10 
% {fs= 10/pi. | will ues a decade higher, max freq 100 rad/sec. 


ts=.02; 
fs=1/ts; 


by i a Aa ca A eR le ial la te Rl i a eae a esi 


% Digitizing the Continuous Plant with sampling frequency of 50 Hz 
[Ad,Bd,Cd,Ddj=bilinear(A,B,C,D, fs); 


79 


rn el ha aie ie a hE a dh ll a as 


% A4_bode.m 


re ee See Oe Se SUE SS SO 8 SOS SR: OSS, SSO S S88 Se SS 888 8 8.8 8 eS S88 eee eee eee 


% This program was written to construct plot the bode response for the 

% longitudinal mode for the A4-D. The bode of the two piants are then 

% compared. 

a a ala Rl MR Ik Ah SM ce Rh lg 
clear 

clg 


QP AS SRS SS OES FSSA TOSS SST ET OEE CLS SHS SS eee eee ses 


% Data 


A4_cond3 
A4_matrix 


Ol SSR RSS BOSS RS Se Se SSO SS RS Ae SS SS. 888 Sie. 8 S-O 8 SS SO S08 SA Se 88 SR OS 


% Setting up the frequency range. 


points=800; %l have 200 pts/decade 
w=logspace(-2, 1,points); 


a a ad hd ih i ac i ead alt Pha ai Mali aa Aerated 


% Check Frequency response of the A-4 


subplot(221) 

[mag phase]=bode(A,B,C(1,:),D(1),1,w); 
semilogx(w,20°log 10(mag)), grid 
xlabel(‘Frequency (rad/sec)'),title('Bode Plot of u') 
axis([.01,10,-40,40]) 


subplot(222) 

[mag phase]=bode(A,B,C(2,:),D(2),1,w); 
semilogx(w,20*log10(mag)), grid 

xlabel('Frequency (rad/sec)'),title(‘(Bode Plot of Alpha’) 
axis([.01,10,-40,40]) 


subplot(223) 

{mag phase]=bode(A,B,C(3,:),D(3),1,w); 

semilogx(w,20*log10(mag)),grid 
xlabei('Frequency (rad/sec)’),title('Bode Plot of q') 
axis([.01,10,-40,40]) 


subplot(224) 
{mag phase]=bode(A,B,C(4,:),D(4),1,w); 
semilogx(w,20*log10(mag)), grid 

xlabel(‘Frequency (rad/sec)’),title(‘Bode Plot of Theta’) 
axis({[.01,10,-40,40}) 


print -deps bode_A4 
Se SLRS SL Ae SS BS SSS Se SSAA IS SIGS 8 ROSS Oe OS SS AAS OES AIS RRR O eS a 88 ee 


% Check Frequency response of the Continuous Plant and the Digitized Plant 
subplot(221) 








[mag phase]=bode(A,B,C(1,:),D(1),1,w); 

[magd phased]=bode(A,B,C(1,:),D(1),1,w); 

semilogx(w,20°log10(mag),‘-',w,20°log10(magd),'-'), grid 
xlabel('Frequency (rad/sec)’),title('Bode Plot of u’') 

axis([.01,10,-40,40]) 


subplot(222) : 

[mag phase]=bode(A,B,C(2,:),D(2),1,w); 

[magd phased]=bode(A,B,C(2,:),D(2),1,w); 

semilogx(w,20°log10(mag),'-',w,20°log10(magd),'-'), grid 
xlabel('Frequency (rad/sec)’),title(‘(Bode Plot of Alpha’) 

axis([.01,10,-40,40]) 


subplot(223) 

[mag phase]=bode(A,B,C(3,:),D(3),1,w); 

[magd phased]=bode(A,B,C(3,:),D(3),1,w); 
semilogx(w,20*log10(mag),’-',w,20°log10(magd),'-'), grid 
xlabel('Frequency (rad/sec)’'),titie('Bode Plot of q') 
axis([.01,10,-40,40]) 


subplot(224) 

[mag phase]=bode(A,B,C(4,:),D(4),1,w); 

{magd phased]=bode(A,B,C(4,:),D(4),1,w); 

semilogx(w,20*log10(mag),'-',w,20*log 10(magqd),'-'), grid 
xlabel('Frequency (rad/sec)’),title(‘Bode Plot of Theta’) 

axis([.01,10,-40,40]) 


print -deps bode_both 





be a I Ri Akh aN Be i hh ig ll i lid 


% Rand_seq.m 


ae Ra ad a ik gd a a ec a li ed 


% This program computes a random binary sequence with a frequency content of 


% f_low to f_high (.001 to 5 Hz) 


le aa a a Da Rial ald ad 


% damp(eig(A)) 


f_high=5; % 1 decade higher than highest freq of interest (.45 Hz) 
f_low=.001; % 1 decade lower than lowest freq of interest (.012 Hz) 





% Eigenvalue Damping Freq. (rad/sec) 

% -0.6245 + 2.6988) 0.2254 2.7701 

% -0.6245 - 2.6988i 0.2254 2.7701 

% -0.0088 + 0.07471 0.1177 0.0752 

% -0.0088 - 0.0747) 0.1177 0.0752 

ts=.02; 

time=1/f_low; % time is the period of the lowest freq, this sets the 


% minimum length of the random signal 


points=2*time*f_high; 


Rbnm=2*round(rand(points+1,1)')-1;% creates a random binary sequence of +1 or 


-1 


Rbnm=ones(5,1)*Rbnm; % the random binary signal needs to be sampled at a 
% faster rate than it was produced, therefore each 
% value of Rbnm is duplicated by an amount equal to 
% fs/(t_high*2) which must be a whole number, in 


% this case 5. 


Rbnm=Rbnm(:); % converting the row matrix to a column matrix 


t=0:ts:(points+1)/(2°5)-ts; 


t=t’; 


82 


a al a a a aaa Se i ce 


% Rand_piot.m 


aa a SN I Nh lac lal 


% This program computes a random binary sequence with a frequency content of 
% f_low to f_high and plots 3 seconds. 


Fe 8088 8.010.888 SSS SS SSS SS Se, OS eae Se Se Oe eee Se poe re he oe 


axis((1,2,3,4]), axis 


ts=.02; 

f_high=5; % 1 decade higher than highest freq of interest (.5 Hz) 
f_low=.001; % 1 decade lower than lowest freq of interest (.016 Hz) 
time=1/f_low; % time is the period of the lowest freq, this sets the 


% minimum length of the random signal 

points=2*time*f_high; 

Rbnm=2*round(rand(points+1,1)')-1; % creates a random binary sequence of +1 or 

-1 

Rbnam=ones(5,1)*Rbnm; % the random binary signal needs to be sampled at a 
% faster rate than it was produced, therefore each 
% value of Rbnm is duplicated by an amount equal to 
% fs/(t_high*2) which must be a whole number, in 
% this case 5. 

Rbnm=Rbnm(:); % converting the row matrix to a column matrix 

% Plotting the first five seconds of the random binary sequence 

t=0:ts:3; 

tst'; 

Rbnm=Rbnm(1:length(t),1); 


Displot(t,Rbnm),grid 
axis({0,3,-2,2]); 


print -deps RBN 


axis 


83 


GG RAS SS SS S88 OOPS SSO 8 8 8 Se 088.8 8 818 8 88S ee Oe SSS Se 8S Oe ee 8 ee 8.2 8:0 8 8e 


% Rand_short_seq.m 

a a a cl a aH a Ra a ld 
% This program computes 5 sec of a random binary sequence with a frequency 

% content of flow to f_high. 

BE OSS A SSS SS SES OOO SSO OS SSS OSS OSS 8 ONS SSS SNS A ESAS SS Se e818. 8 0 88S 
ts=.02; 

points=50; % 5 sec 
Rbnm=2‘round(rand(peoints+1,1)')-1; % creates a random binary sequence of +1 
or -1 


Rbnm=ones(5,1)*Rbnm; % the random binary signal needs to be sampled at a 
% faster rate than it was produced, therefore each 
% value of Rbnm is duplicated by an amount equal to 
% fe/(t_high*2) which must be a whole number, in 
% this case 5. 


Rbnm=Rbnm(:); % converting the row matrix to a column matrix 


t=0:ts:(points+1)/(2°5)-ts; 
tst'; 














Spectrum_Rbnm.m 
Specplot2, plots the output of the SPECTRUM function along with the bode plot 
of the plant for comparison. 

SPECPLOT(P,Fs,state,A,B,C,D), uses: 


P the output of SPECTRUM, 

state the desired state(1,2,3 or 4) 

Fs the sample frequency, 

A,8,C,D State State representation of plant 


to plot: 


1. Transfer Function Magnitude from Spectral Analysis 
2. Transfer Function Magnitude from Bode of pliant 


J=size(P); 
(1:n-1)/n*Fs/2; 


WS &  F at ak ak ak ae af a af a€ af af af 


plot(f,20*log10(abs(P(2:n,1)))),grid,xlabel(‘Frequency (Hz)'), 
axis({.01,10,-40,40]) 

















a a Aa a al el ld le alia anc 


% Spectrum_NN_d.m 


GRRL A STL CERTECERE EAA CORSON SES SS RS SSUES eS ST ee ee 


% This program does the spectrum analysis of the output of the Neural Network. 


E688 CF SEES SS SEEKER ELSE SNS COATES ER TRS EN OSES ELE SS ETT 


clear 

A4_cond3 
Md=.1*Md; 
A4_matrix 

load test.nnr 
input=test(:,1); 


u_out=test(:,25); 
P=spectrum(input,u_out,25000); 
Specplot1(P,50,1,A,8,C,D) 
title('Bode Plot of u') 


alpha_out=test(:,28); 
P=spectrum(input,alpha_out,25000); 
Specplot1(P,50,2,A,B,C,D) 

title(‘Bode Plot of Alpha’) 


q_out=test(:,26); 
P=spectrum(input,q_out,25000); 
Specplot1(P,50,3,A,B,C,D) 
title('Bode Plot of q’) 


theta_outs=test(:,27); 
P=spectrum(input,theta_out,25000); 
Specplot1(P,50,4,A,B,C,D) 
title('Bode Plot of Theta’) 


print -deps Md_1.eps 


86 








a a Rt a le a ad ca al ee a le i a 


% Gen_A4_data.m 


OS Re SRT Si SSO OO OSCR R88 O80 R88 RS SNS SS OBS OSS OS, OS, 88 See Oe ee ee eee 


% This program calls sub-programs in order to generate the learning, 
% testing and minmax file to be used by the neural network. 


el Ra ae hh Rh a lds hh 


clear 
A4_cond3 
A4_matrix 
rand(‘seed', 1) 


a ea dah aa eh Na i Ohl i il liad he di id 


% Generate file “learn.nna” (learn data for NN) 


Rand_seq 

linsim('A4_diag' length(t)*ts); 

Organize % returns output vector 
minmaxlearn=[min(output_vector');max(output_vector’)]'; 


fid=fopen(‘learn.nna’,'w') 

fprintf(fid,"“og %g %g %g %g %y %y %y %y %y %Y %y %y %y %Y %y %Y %yg %y %g 
%g %g %g %g\n',output_vector) 

fclose(fid) 


QPS SS Oe OR Oe 8 SS ESO OS 88 SON SSO 8 8S 88 8A a 8 OOS F888 88-8 FF 8 8 O88 


% Generate “test.nna” data (test data for NN) 
rand('seed',0) 


Rand_seq 

linsim('A4_diag’' length(t)*ts); 

Organize 
minmaxtest=(min(output_vector');max(output_vector')}'; 


fid=fopen(‘test.nna’,'w') 

fprintf(fid,"%g %g %g %g %g %g %g %y %y %y %Y My %g hg %g %yY %Y %g %y %Y 
%g %g %g %g\n',output_vector) 

fclose(fid) 


oe I a ak BA BD Be i ei Rh i a ri! 


% Generate “minmax" data (minmax data for NN) 
minmax_A4=(min({minmaxlearn minmaxtest]');max({minmaxiearn minmaxtest]')]'; 


fid=fopen('minmax_A4.nna’','w') 

fprintf(fid,"%g %g %g %g %g %g %yg %g %g %Yg %Y %Y %Y %y %y %y MY Mg %y %yY 
%g %g %y %g\n'’,minmax_A4) 

fclose(fid) 











OG F Re S SER Oe ORR SOS SSS SR, OE SSO SIRS SAE SSN OS OSE OSS SOS S 8 Se OSS eee SRO Se 


% Gen_damaged_data.m 

Ra a ad a i SR a a ic la Ps ld 
% This program calls sub-programs in order to generate the leaming, 

% testing and minmax file to be used by the neural network for the damaged A4. 


OE SAS SSS SSS 888 8 e 8 888 88 8S 8S 88 a 8 SS 88S 828 S18 e SS 88S SOS SO Sl8 B'S SSN 8s SSS: 


clear 
A4_cond3 
rand(‘seed’,1) 


by IR Nec ek aah alo a hd lel Rea i aN de tara alah tats a i 


% Generate file “learn_damaged.nna" (leam damaged data for NN) 


Md=.3°Md; % Make elevator 30% of original size. Elevator reduced by 70% 
A4_matrix 


Rand_seq 

linsim('A4_diag', length(t)*ts); 

Organize 
minmaxlearn=(min(output_vector');max(output_vector’)]'; 


tid=fopen(‘learn_damaged.nna’,'w’) 

fprintf(fid,%g %g %g %g %g %g %g %g %y %y %G %g %g %y %Q %Y %Y %Y %y %g 
%g %g %g %g\n',output_vector) 

fclose(fid) 


QOS OSS 8 Ae 8 SAS OSS SOS See SS SAS OS OES SAA SS SSS Oe 8S RR OS SS Se SO See 


% Generate "test_damage.nna’ data (test data for NN) 
rand(‘seed’,0) 


Rand_seq 

linsim(‘A4_diag’ ,length(t)*ts); 

Organize 
minmaxtest=[min(output_vector');max(output_vector')]}'; 


fid=fopen(‘test_damage.nna’,'w') 

fprintf(fid,"%g %g %g %g %g %g %gG %G %G %Y %G %y %yg %yq %g %YG %Yg %G %Yg %Yg 
Gg %G %g %g\n',output_vector) 

fclose(fid) 


a a i a a a a ND A ht ia Ba Led a dad 


% Generate “minmax’ data (minmax data for NN) 


minmax_damaged=(min({minmaxlearn minmaxtest]');max([minmaxiearn 
minmaxtest]')]'; 


fid=fopen('‘minmax_damaged.nna’,'w') 

fprintf(fid,"%g %g %g %g %yg %y %g %g %y %y KY %y “yg %yg MG %Y My Hy %y %Y 
%Q9 %g %g %g\n',minmax_damaged) 

fclose(fid) 


88 














a al a Dc hl a A a a cl 


% A4_step.m 

a a TiS a aN aA NA all Ret Rd 
% This program was written to look at the step responses of all 4 states with different 
% values of Md. 

Of, O18 8 O88 8:8 OSS iS 8 ie Oe Oe ee eee ee ee eee ee ee eee Oe ee eee eee Se aie wie eens eee 
A4_cond3 

A4_matrix 

T=1:1:400; 

[y1,x1,t1]=step(A,B,C,D,1,T); 


A4_cond3 

Md=.7*Md; 

A4_matrix 
[y2,x2,t2]=step(A,B,C,D,1,T); 


A4_cond3 

Md=.3°Md; 

A4_matrix 
[y3,x3,t3]=step(A,B,C,0,1,T); 


subplot(221) 
plot(T,y1(:,1),'~",T,y2(:,1),':',T,y3(:,1),'-.’), ylabel(‘Amplitude’) 
xlabel('Time (sec)’),title('Step Response in Airspeed’), grid 


subplot(222) 

plot(T,y1(:,2),'-'",T,y2(:,2),':',T,y3(:,2),'-.'), ylabel(‘Amplitude’), 
xlabel('Time (sec)’),title(‘Step Response in Angle of Attack’),grid 
subpiot(223) 


plot(T,y1(:,3),'-', T,y2(:,3),':',T,y3(:,3),'-.'), ylabel('‘Amplitude’), 
xlabel(‘Time (sec)'),title(‘Step Fiesponse in Roll Rate’), grid 


subplot(224) 

plot(T,y1(:,4),'-'",T,y2(:,4),':', T,y3(:,4),'-.),  ylabel('Amplitude’), 
xlabel('‘Time (sec)'),title(‘Step Response in Pitch Angle’), grid 

print step_plot -deps 


subplot(111) 


89 











ba a a a ch a ak cd A Ta Rd ea a ll ara 


% Organize.m 

bE a a al hd a i Mi 
% This program organizes the data and computes the output file to be used by 

% the neural network. 


% e@enmhtertekezeeeeteenee 2 enteteezeneeteaeeeknaeteneaneneeeantetenknetanrneenese 


input_Rbnm= = [[Rbnm;0;0;0] [0;Rbnm;0;0] [0;0;Rbnm;0] [0;0;0;Rbnm]]; 
input_u= {{u;0;0;0] (0;u;0;0] [0;0;u;0] {0;0;0;u]]; 
input_q= [[q;0;0;0) [0;q;0;0] {0;0;q;0]} [0;0;0;q]); 
input_theta= {{theta;0;0;0] [0;theta;0;0] [0;0;theta;0] [0;0;0;theta]]); 


input_alpha= _ [[alpha;0;0;0] [O;alpha;0;0] [0;0;alpha;0] [0;0;0;alpha]]; 
output= [{u;0;0;0] [q;0;0;0) [theta;0;0;0] [alpha;0;0;0)]; 


clear Robnm 
clear u 
Clear q 
clear theta 
clear alpha 


output_vector=[[input_Rbnm],[input_u],[input_q],[input_theta],[input_alpha],[outp 
ut)]; 
output_vector=output_vector(6:length(output_vector)-3,:)'; 


clear input_Rbnm 
Clear input_u 
clear input_g 
Clear input_theta 
clear input_alpha 
clear output 


90 





OG PR 8 See OS OSS A 88 8.8 8:8 OS 8:8 S88 e OAS OSS ASO Oe. 88 OOS SSS 8 88S eee 


% Gen_both_data.m 

OL 8 2 E888 SN SAS OSS SSS SOS O86 WSR 8S SSA SS SOS SOO 88 SS SSS OS Se ewe eo ee 
% This program loads the learn file generated by the undamaged plant and 

% the leam file generated by the damaged plant and interleaves them. 

a Ma A ia Be de a ER iS eRe lk ke Mili cl I ted 
clear 

load leam.nna 

load leam_damaged.nna 

load minmax_A4.nna 

load minmax_damaged.nna 


learn=learn': 
learn_damaged=leam_damaged'; 


a hah it al cl ia ill ad Ll 


% Generate file “learn_both.nna” (combined learning data for NN) 


(m,n]=size(learn); 
output_vector=zeros(m,2*n); 
mix=20; 


for l=1:mix; 
output_vector(:,1:2°mix:2°n)=learn(:,I:mix:n); 
output_vector(:,|+mix:2*mix:2°n)=learn_damaged(:,|:mix:n); 


fid=fopen(‘learn_b.nna’','w’) 


fprinti(fid, %g %g %g %g %g %Q %g %Q %y %Y %y %Q %Q %y %g “ng %y %Q %g %g 
fc no i ia al 
Cc 


minmaxfiles=[minmax_A4;minmax_damaged]; 

minmax_b=[min(minmaxfiles) max(minmaxfiles)]; 
fid=fopen(‘minmax_b.nna’,'w’) 

fprinti(fid,"%g %g %g %g %g %yg %yg %yg %y %g %y %g %y %y %G %Y %Y %Y %yg %y 


%g %g %g %g\n',minmax_b) 
fclose(fid) 


91 








Gf PP SSO SR OS PS: SOS OSS S SS 8 SSS OS Ae SOS SOS 8 SO 8818 88 8 88 Oe S28 eee Sree 88:88 


% Specploti.m 

a a a a kA iS NR ld ladies 
Specplott, plots the output of the SPECTRUM function along with the bode piot 

of the plant for comparison. 

SPECPLOT1(P,Fs,state,A,B,C,D), uses: 


P the output of SPECTRUM, 

state the desired state(1,2,3 or 4) 

Fs the sample frequency, 

A,B,C,D State State representation of plant 


to plot: 


1. Transfer Function Magnitude from Spectral Analysis. 
2. Transfer Function Magnitude from Bode of plant 


KKFKRKKK KKK KKK LK 


C68 EO e SNe Se ee 8S 88S SR SES SOS SO SSS SO SSS SR SS OSS SSS. 8 OO 8:8 S88 8 88S 


function specplot1(P,Fs,state,A,B,C,D) 


([n,m]=size(P); 
f = (1:n-1)/n*Fs/2; 


{mag phase w]=bode(A,8,C(state,:),D(state)); 
subplot(2,2,state) 
semilogx(2*pi*t,20°log10(abs(P(2:n,4))),'-',w,20°log10(mag),’-'), grid 


xlabel('Frequency (rad/sec)') 
axis({.01,10,-40,40]) 


92 


Ber ceresveeesersessensensceceresanseresasessseassasssaassesesseass 


% Specplot2.m 


Specplot2, plots the output of the SPECTRUM function along with the bode plot 
of the plant for comparison. 

SPECPLOT(P), uses: 

P, the output of SPECTRUM 


Transfer Function Magnitude from Spectral Analysis. 


HK F a af € a af F af 


Of FF ES SSeS ACCEL CEST ET ETERS RERERE ES ESR EL CASH ON EO CORE SE 8 8 Se 


function specplot2(P) 
Fs=50; 

[n,m]=size(P); 

f = (1:n)/n*Fs/2; 

faf'; 
loglog(f,P(:,1)), grid 
xlabel('Frequency (Hz)') 


ylabel(‘Magnitude’) 
axis([.001,30,.0001,20]) 


93 











OG OOS FRE Se S888 828 O80 8 2 8:8 8 888 8:8 8 OS Se 8 HS Se SS SO 88 8 C188 28S 8 8-8/8 eS 


% Gen_response_data.m 

Ee ON RS ee Oe RNa A SSS LS S18 SEO 8S OSA Se SAS SO 8S O88 88 88S 88 eee eee eS Se ee 
% This program calls sub-programs in order A generate 10 seconds of test data 

% for plotting the output of the u 


unt Coca verruaes Parsarme cts | icesiud ie euiac anal read 


% Generate 10 seconds of undamaged A4 


clear 
A4_cond3 
A4_matrix 
rand(‘seed’,0) 
ec cca a i a all Mic ial la ied 
ts=.02; 
points=100; % 10 sec 
Rbnm=2*round(rand(points+1,1)')-1; | % the random binary signal needs to be 
% sampled at a faster rate than it was 
% produced, therefore each value of Rbnm 
% is duplicated by an amount equal to 
% fa/(t_high*2) which must be a whole 
% number, in this case 5. 


Rbnm=Rbnm(:); % converting the row matrix to a column matrix 


t=0:ts:(points+1)/(2°5)-ts; 
t=t'; 


linsim(‘A4_diag' length(t)*ts); 
Organize % returns output_vector 
switched=output_vector; 


cic Ra Ma LD i Mie iad iat Ll alli eld od dh acl ial aaa 


fid=fopen(‘switched.nna’','w') 

fprintf(fid,"%g %g %g %g %g %g %yg %y %y %y %Y %Y %g %Y %Y %y %y % %Y %yg 
By seltign %g\n',switched) 

close(fi 





94 


ae a lt a ld il aN aR Ra alli ia il Bilal 


% 


OG FO 28 88 S88 SOO SS SO OOS SSS OOO 8. 8:88 SO 8S S. 8 SSS BO Se AS SOS 8S, O88 Oe 2888S 


% This program is used to generate the test files with various values of Md. 


Oh, FSS SSS eee 8S EOS 8 AS 8 8 A O88 SOS 8 O88 88S 888 SOSA 88S 8S 8 Oe 8 8886 O89 


clear 

A4_cond3 

rand(‘seed',0) 

Ma Saye ee ee ee gre ee a tn ee tay ee ena etn Pina Se Tce 
percent = .1 % change .1 for 1.0 to 0.1 as necessary 
Md=percent*Md; 

A4_matrix 


seq 
linsim('A4_diag' length(t)*ts); 
Organize 


tid=fopen('damaged1.nna','w') 

fprintf(fid, 6g %g %g %g %g %y %g %y %Y %y %g %Y %Y %Y %Y %Y %Y %Q XQ %Y 
Baty ea %g\n' output_vector) 

close(fi 


95 





Bert eteneeeeneeaeeneaeneneteneerereerrerereseseonsenenanenneer 


% Gen_switched.m 

Herter etneneeewaveesesanceneneeteeenansesrenenenseeretetesnetrener 
% This program generates 5 seconds of undamaged A4 data followed by 5 seconds 

% of damaged A4 data. 


Brewers ertsesoseeaaerereseeeesasavevererererseraenersenereseereeeer 


% 

clear 
A4_cond3 
A4_matrix 
rand(‘seed’ 0) 


Rand_short_seq 
linsim('A4_diag' ,length(t)*ts); 


Rbnm1=Rbnm; 
ul=u; 

qi=q; 
theta1=theta; 
alpha1 «loha; 


Md=0.1°Md; 

A4_matrix 

Rand_short_seq 
linsim(‘A4_diag' Jength(t)*ts); 


Rbnm=[Rbnm1;Rbnm]); 
u=(u1;u); 

q=(q1;q]; 
theta=(theta! ;theta]; 
alpha=[alpha1;alpha]; 


ize % returns output_vectorP 
switched=output_vector; 


fid=fopen(‘switched.nna’','w') 

fprinti(fid, 4g %g %g %g %g %yg %y %Y %y %y % %Q %y %Y %Y %Y %Y %G %G %g 
i be iy %g\n' switched) 

close(fi 


96 





a a Nh a a a aa a ili Rede teh alice ial td 


% Switch_plot.m 

a a a Nl i i ce a aie aha 
% Thsis program plots the output of the neural ne"vork after 10 seconds of data 

% have been supplied to the network. 

ee AE A AS eee Oe See @eee@eenreenrkrekeaetrttereraeatneeeeneeneneee 
clear 

ts=.02; 

load switched.nnr; 


u_out=switched(1:500,25); 
alpha_out=switched(1:500,28); 
q_out=switched(1:500,26); 
theta_out=switched(1:500,27); 


U=switched(1:500,21); 
Alpha=switched(1:500,24); 
Q=switched(1:500,22); 
Theta=switched(1:500,23); 


T=0:ts:10-ts; 


subplot(221) 
plot(T,u_out,'-',T,U,'.'), ylabei(‘Amplitude') 
xlabel(‘Time (sec)'),title(‘Airspeed’), grid 


subplot(222) 
plot(T,alpha_out,'-',T,Alpha,'.'), ylabel('‘Amplitude’), 
xlabel('Time (sec)'),title(‘Angle of Attack’),grid 


subplot(223) 
plot(T,q_out,'-',T,Q,'.'), ylabel(‘Amplitude’), 
xlabel(‘Time (sec)'),title(‘Pitch Rate’), grid 


subplot(224) 
plot(T, theta_out,'-',T,Theta,'.'), ylabel('Amplitude’), 
xlabel(‘Time (sec)'),title('Pitch Angle’), grid 


print -deps switched1 
subplot(111) 





APPENDIX B: NEURALWORKS SETUP SPECIFICS 


In order to provide a means of repeating the thesis research, the 
specific values used in the setup of NeuralWorks are described 
below. 

The leaming rate coefficient, 1 was selected to be 0.3 for the 
hidden layer and 0.15 for the output layer. The input layer is a 
buffer and has no learning rate coefficient. The momentum was 
selected to be 0.4, the leaming rate coefficient ratio was selected 
to be 0.5., the offset was chosen to be 0.1, and the transition point 
was chosen to be 10,000. Although, a transition point of 50,000 
(the length of the data files) was initially chosen, the transition 
point of 10,000 provided better results during leaming. “Hyperbolic 
tangent” was selected as the activation function (bipolar data), and 
“delta rule” was selected as the Learn Rule. Although the presence 
of noise during learning supposedly helps the network to generalize 
to the test data, this did not prove to be the case. As a result the 
network was trained and tested without noise added by NeuralWorks. 
An epoch of 1 was chosen and “fast learning,” was also selected. 
(Although epoch does not apply to the Delta Rule during /earning, it 
does apply to the output files or instruments created by 
NeuralWorks, i.e. an epoch of 5 and an input file 50000 long will 
generate an output of file 10000 long. This will cause the output 


data to be averaged over five inputs resulting in a lost of accuracy.) 





For “I/O° (input/output) parameters, “file sequence” was 
selected for both the learn and test files. This setting requires the 
data files to be read in sequential order, rewinding once the end of 
the file is encountered. Although random file selection would appear 
to have been a better choice, the network always saturated during 
leaming. This could have been due to the fact the input file was 
longer than 32768 records. According to NeuralWorks, data from 
files less than this size are automatically selected randomly until 
all of the records are presented and then the file is rewound, 
ensuring each data record was used the same number of times. Data 
from files greater that this size have their records selected 
randomly in a gausian distribution. 

The minmax table is used by the network to normalize the data. 
As mentioned earlier, the data sets generated by Matlab were too 
large for NeuralWorks to automatically generate the required 
minmax table, so a minmax table was generated using Matlab. 
Selecting + 0.8 in “network ranges” scales the squashing function for 
the network. Descaling of the output is done automatically. lf the 
data is not scaled properly, the network well saturate and learning 


will cease. 


99 








(DROR 92] 


[GROS 73] 


(HECH 89] 


[MCRU 93] 


(MINS 69] 


[MOLE 90} 


[NEUR A 93] 


(NEUR B 93] 


[NEUR C 93] 


[PARK 85] 


LIST OF REFERENCES 


Dror, Shahar, “Identification and Control of Non- 
Linear Time-Varying Dynamical Systems Using 
Artificial Neural Networks,” Doctoral Thesis, 
Naval Postgraduate School, Monterey, California, 
1992. 


Grossberg, S., Studies in Applied Mathematics, 
New York: Academic Press, 1973. 


Hecht-Nielsen, Robert, Neurocomputing, Addison- 
Wesley, 1991. 


McRuer (Duane), Ashkenas (Irving), and Graham 
(Dustan), Aircraft Dynamics and Automatic 
Control, Princeton University Press, Princeton, 
New Jersey, 1973. 


Minsky, Marvin, and Papert, Seymore S.., 
Preceptrons, MIT Press: Cambridge MA, 1969. 


Moler, C., J. Little, and S. Bayert, MATLAB User's 
Manual, The Mathworks, Inc., 1990. 


NeuralWare, Neural Computing, Technical 
Publications Group, 1993. 


NeuralWare, Reference Guide, Technical 
Publications Group, 1993. 


NeuralWare, Using Neuralworks, Technical 
Publications Group, 1993. 


Parker, David P., Learning -Logic, Cambridge, MA: 
Massachusetts Institute of Technology, Center for 
Computational Research in Economics and 
Management Science, 1985. 





[RUMM 85] 


[SCOT 89] 


[WASS 89] 








Rumeihart, D.E. and McClelland, J.L., editors, 
Parallel Distributed Processing: Explorations in 
the Microstructure of Cognition, MIT Press, 1986. 


Scott, Russel W., “Application of Neural ! 2tworks 
to Adaptive Control, “Engineers Thesis, Naval 
Postgraduate School, Monterey, Caiifornia, 1989. 


Wasserman, Philip D., Neural Computing Theory 
and Practice, Van Nostrand Reinhold, 1989. 


101 











INITIAL DISTRIBUTION LIST 


. Defense Technical information Center 2 


Cameron Station 
Alexandria, Virginia 22304-6145 


. Library, Code 52 2 
Naval Postgraduate School 
Monterey, California 93943-5100 


. Chairman, Code AA 4 


Department of Aeronautics and Astronautics 
Naval Postgraduate School 
Monterey, California 93943-5000 


. D.J. Collins, Code AA/Co 2 


Department of Aeronautics and Astronautics 
Naval Postgraduate School 
Monterey, California 93943-5000 


. Richard M. Howard, Code AA/Ho 1 


Department of Aeronautics and Astronautics 
Naval Postgraduate School 
Monterey, California 93943-5000 


. H. Titus, Code EC/Ts 1 


Department of Aeronautics and Astronautics 
Naval Postgraduate School 
Monterey, California 93943-5000 


. LCDR(s) Clifford A. Brunger 5 


6941 Enbome Lane 
San Diego, California 92139 








